Sliding Suffix Tree
نویسندگان
چکیده
We consider a sliding window W over a stream of characters from some alphabet of constant size. The user wants to perform deterministic substring matching on the current sliding window content and obtain positions of the matches. We present an indexed version of the sliding window based on a suffix tree. The data structure of size Θ(|W |) has optimal time queries Θ(m + occ) and amortized constant time updates, where m is the length of the query string and occ is the number of its occurrences. 1998 ACM Subject Classification F.2.2 Nonnumerical Algorithms and Problems, E.1 DATA STRUCTURES, F.1.2 Modes of Computation
منابع مشابه
Suffix Tree for a Sliding Window: An Overview
The suffix tree is a very powerful data structure developed originally for string matching and string searching. It has found many applications over the time and some of them belong into the data compression field. Many of these applications need a suffix tree built for a sliding window and there exist two clever algorithms by Fiala and Greene and by Larsson that make this possible. However, as...
متن کاملCompact Directed Acyclic Word Graphs for a Sliding Window
The suffix tree is a well-known and widely-studied data structure that is highly useful for string matching. The suffix tree of a string w can be constructed in O(n) time and space, where n denotes the length of w. Larsson achieved an efficient algorithm to maintain a suffix tree for a sliding window. It contributes to prediction by partial matching (PPM) style statistical data compression sche...
متن کاملAttack of the Mutant Suffix Trees
This is a thesis for the degree of filosofie licentiat (a Swedish degree between Master of Science and Ph.D.). It comprises three articles, all treating variations and augmentations of suffix trees, and the capability of the suffix tree data structure to efficiently capture similarities between different parts of a string. The presented applications are in the areas of data compression and patt...
متن کاملCompact Suffix Trees Resemble PATRICIA Tries: Limiting Distribution of the Depth
Suffix trees are the most frequently used data structures in algorithms on words. In this paper, we consider the depth of a compact suffix tree, also known as the PAT tree, under some simple probabilistic assumptions. For a biased memoryless source, we prove that the limiting distribution for the depth in a PAT tree is the same as the limiting distribution for the depth in a PATRICIA trie, even...
متن کاملMost Recent Match Queries in On-Line Suffix Trees
A suffix tree is able to efficiently locate a pattern in an indexed string, but not in general the most recent copy of the pattern in an online stream, which is desirable in some applications. We study the most general version of the problem of locating a most recent match: supporting queries for arbitrary patterns, at each step of processing an online stream. We present augmentations to Ukkone...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1801.07449 شماره
صفحات -
تاریخ انتشار 2018